Casual Conversation Technology Achieving Natural Dialog with Computers
نویسندگان
چکیده
In recent years, voice recognition agents such as NTT DOCOMO’s “Shabette Concier” have become popular. ShabetteConcier is a voice agent capable of responding to task-related utterances such as “send mail” or “call,” and can answer questions such as “how high is Mount Fuji?” or “what is the highest mountain in the world?” It can also respond to casual conversation with utterances such as “I love you” or “hello.” It is highly convenient for users to be able to simply make utterances to perform a task or request particular information. Nevertheless, Shabette-Concier is not only used to enjoy these conveniences – users also talk to it using a wide range of day-to-day chat, suggesting that user desire for casual conversation is very high. However, Shabette-Concier is only able to give precise replies to utterances within the bounds of assumptions, and does not have sufficient variation in its responses. Therefore, we believe we can offer casual conversation as popular content to users and expand communication module installations in new devices such as robots, games and vehicles, and apply this technology to a range of businesses to satisfy user demand for casual conversation technologies. To respond to the user demand for casual conversation, we have developed a casual conversation system based on the technical achievements of NTT Media Intelligence Laboratories. With this system, we have aimed to enable natural conversation between computers and human beings, using utterance data created from large-scale data to generate a rich range of responses – the system does not repeat one-off utterances with users, but enables multiple and varied exchanges. This article describes an overview of the system and the dialog technology it uses.
منابع مشابه
Supporting Casual Interaction Between Intimate Collaborators
Over last decade, we have seen mounting interest in how groupware technology can support electronic interaction between intimate collaborators who are separated by time and distance. By intimate collaborators I mean small communities of friends, family or colleagues who have a real need or desire to stay in touch with one another. While there are many ways to provide electronic interaction, per...
متن کاملAnalysis of Listening-Oriented Dialogue for Building Listening Agents
Our aim is to build listening agents that can attentively listen to the user and satisfy his/her desire to speak and have himself/herself heard. This paper investigates the characteristics of such listening-oriented dialogues so that such a listening process can be achieved by automated dialogue systems. We collected both listening-oriented dialogues and casual conversation, and analyzed them b...
متن کاملDisappearing Computers, Social Actors and Embodied Agents
Presently, there are user interfaces that allow multimodal interactions. Many existing research and prototype systems introduced embodied agents, assuming that they allow a more natural conversation or dialogue between user and computer. Here we will first take a look at how in general people react to computers. We will look at some of the theories, in particular the CASA (“Computers Are Social...
متن کاملA Flexible Approach to Natural Language Generation for Disabled Children
Natural Language Generation (NLG) is a way to automatically realize a correct expression in response to a communicative goal. This technology is mainly explored in the fields of machine translation, report generation, dialog system etc. In this paper we have explored the NLG technique for another novel applicationassisting disabled children to take part in conversation. The limited physical abi...
متن کاملConversation Machines for Transaction Processing
We have built a set of integrated AI systems (called conversation machines) to enable transaction processing over the telephone for limited domains like stock trading and banking. The conversation machines integrate the state-ofthe-art technologies from computer telephony, continuous speech recognition, natural language processing and humancomputer interaction. Users can interact with these sys...
متن کامل